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DNA sequence and preparation of grass pollen allergen Phi p 4 by 

recombinant methods 



5 Background of the invention 

The present invention relates to the provision of the genetic sequence of 
the major grass pollen allergen Phi p 4. The invention also covers frag- 
ments, new combinations of partial sequences and point mutants having a 
hypoallergenic action. The recombinant DNA molecules and the derived 
10 polypeptides, fragments, new combinations of partial sequences and vari- 
ants can be utilised for the therapy of pollen-allergic diseases. The proteins 
prepared by recombinant methods can be employed for the in vitro and in 
vivo diagnosis of pollen allergies. 

15 Type 1 allergies are of importance worldwide. Up to 20% of the population 
in industrialised countries suffer from complaints such as allergic rhinitis, 
conjunctivitis or bronchial asthma. These allergies are caused by allergiens 
present in the air (aeroallergens) which are liberated from sources of vari- 
ous origin, such as plant pollen, mites, cats or dogs. Up to 40% of these 

20 type 1 allergy sufferers in turn exhibit specific IgE reactivity with grass 

pollen allergens (Freidhoff et al., 1986, J. Allergy Clin. Immunol. 78, 1190- 
2001). 

The substances which trigger type 1 allergy are proteins, glycoproteins or 
25 polypeptides. After uptake via the mucous membranes, these allergens 
react with the IgE molecules bonded to the surface of mast cells in sensi- 
tised individuals. If two IgE molecules are crosslinked to one another by an 
allergen, this results in the release of mediators (for example histamine, 
prostaglandins) and cytokines by the effector cell and thus in the corre- 
30 spending clinical symptoms. 
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A distinction is made between major and minor allergens depending on the 
relative frequency with which the individual allergen molecules react with 
the IgE antibodies of allergy sufferers. 

5 In the case of timothy grass (Phieum pratense). Phi p 1 (Petersen et al., 
1993. J. Allergy Clin. Immunol. 92: 789-796). Phi p 5 (Matthiesen and 
Lowenstein, 1991. Clin. Exp. Allergy 21: 297-307; Petersen et a!.. 1992, Int. 
Arch. Allergy Immunol. 98: 105-109). Phi p 6 (Petersen et al., 1995. Int. 
Arch. Allergy Immunol. 108, 49-54). Phi p 2/3 (Dolecek et al.. 1993. FEBS 

10 335 (3). 299-304). Phi p 4 (Haavik et al., 1985. Int. Arch. Allergy Appl. 

Immunol. 78: 260-268; Valenta et al., 1992, Int. Arch. Allergy Immunol. 97: 
287-294, Fischer et al., 1996, J. Allergy Clin. Immunol. 98: 189-198) and 
Phi p 13 (Suck et al., 2000, Clin. Exp. Allergy 30: 324-332; Suck et al., 

2000, Clin. Exp. Allergy 30: 1395-1402) have hitherto been identified as 
15 major allergens. 

Phi p 4 has been mentioned as a basic glycoprotein having a molecular 
weight of between 50 and 60 kDa (Haavik et al.. 1985. Int. Arch. Allergy 
Appl. Immunol. 78: 260-268). The Phi p 4 molecule is trypsin-resistant 
20 (Fischer et al., 1996. J. Allergy Clin. Immunol. 98: 189-198). and 70-88% of 
grass pollen allergy sufferers have IgE antibodies against this molecule 
(Valenta et al., 1993, Int. Arch. Allergy Immunol. 97: 287-294; Rossi et al.. 

2001. Allergy 56:1180-1185; Mari, 2003. Clin. Exp. Allergy 33:43-51). 
Homologous molecules have been described from related grass species 

25 (Suetal., 1991, Clin. Exp. Allergy 21: 449-455; Jaggi et al., 1989, Int. Arch. 
Allergy Appl. Immunol. 89: 342-348; Jaggi et al., 1989, J. Allergy Clin. 
Immunol. 83: 845-852; Leduc-Brodard et al., 1996, J. Allergy Clin. 
Immunol. 98: 1065-1072; 14-17). These homologous molecules of the 
Poaceae form allergen group 4. whose molecules have high immunological 

30 cross-reactivity with one another both with monoclonal mouse antibodies 
and with human IgE antibodies (Fahlbusch et al., 1993 Clin. Exp. Allergy 
23:51-60; Leduc-Brodard et al.. 1996. J. Allergy Clin. Immunol. 98:1065- 
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1072; Su et al., 1996, J. Allergy Clin. Immunol. 97:210; Fahlbusch et al.. 
1998, Clin. Exp. Allergy 28:799-807; Gavrovic-Jankulovic et al.. 2000. 
Invest. Allergol. Clin. Immunol. 10 (6): 361-367; Stumvoll et al. 2002. Biol. 
Chem. 383: 1383-1396; Grote et aL. 2002. Biol. Chem. 383: 1441-1445; 
5 Andersson and Lidholm, 2003, Int. Arch. Allergy Immunol. 130: 87-107; 
Marl. 2003. Clin. Exp. Allergy. 33(1): 43-51). 

In contrast to the above-mentioned major allergens of Phleum pretense 
(Phi p 1. Phi p 2/3. Phi 5a and 5b. Phi p 6 and Phi p 13). the primary 
structure of Phi p 4 has not yet been elucidated. Likewise, there is no com- 
10 plete sequence of molecules from group 4 from other grass species. 

The determination of the N-terminal amino acid sequence was hitherto un- 
successful. However, the causes of this are not known. Fischer et al. 
(J. Allergy Clin. Immunol., 1996; 98: 189-198) assume N-terminal blocking, 
15 but were able to purify an internal peptide after degradation with lysyl 
endopeptidase and to determine its sequence: IVALPXGMLK (SEQ ID 
NO 7). 

This peptide has homologies to peptide sequences in the ragweed aller- 
gens Amb a1 and Amb a2 and similarities to sequences in proteins from 

20 maize (Zm58.2). tomato (lat 59. lat 56) and tobacco (G10) (Fischer et al.. 

1996. J. Allergy Clin. Immunol. 98: 189-198). For Lolium perenne, peptide 
fragments having the following sequence have been described for the basic 
group 4 allergen: FLEPVLGLIFPAGV (SEQ ID NO 8) and GLIEFPAGV 
(SEQ ID NO 9) (Jaggi et al.. 1989, Int. Arch. Allergy Appl. Immunol. 89: 

25 342-348). 

Peptides have likewise been obtained from the group 4 allergen from 
Dactylus glomerata by enzymatic degradation and sequenced: 
DIYNYMEPYVSK (P15. SEQ ID NO 10), 
30 VDPTDYFGNEQ (P17. SEQ ID NO 1 1), 

ARTAWVDSGAQLGELSY (P20, SEQ ID NO 12) 
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and GVLFNIQYVNYWFAP (P22, SEQ ID NO 13) (Leduc-Brodard et al.. 
1996, J. Allergy Clin. Immunol. 98: 1065-1072). 

Peptides have also been obtained from the group 4 allergen of subtropical 

Bermuda grass {Cynodon dactylon) by proteolysis and sequenced: 
5 KTVKPLYIITP (S, SEQ ID NO 14). 

KQVERDFLTSLTKDIPQLYLKS (V49L. SEQ ID NO 15), 

7VKPLYIITPITAAMI (T33S, SEQ ID NO 16). 

LRKYGTAADNVIDAKWDAQGRLL (T35L. SEQ ID NO 17), 

KWQ7VAPALPDPNM (P2. SEQ ID NO 18), 
10 VTWIESVPYIPMGDK (V26L, SEQ ID NO 19), 

GTVRDLLXRTSNIKAFGKY (L25L, SEQ ID NO 20), 

TSNIKAFGKYKSDYVLEPIPKKS (T22L, SEQ ID NO 21), 

YRDLDLGVNQWG (P3, SEQ ID NO 22). 

SATPPTHRSGVLFNI (V20L, SEQ ID NO 23), 
15 and AAAALPTQVTRDIYAFMTPYVSKNPRQAYVNYRDLD (V14L. SEQ ID 

NO 24) (Llaw et al., 2001, Biochem. Biophys. Research Communication 

280: 738-743). 

However, these described peptide sequences for Phi p 4 and group 4 
20 allergens have hitherto not resulted in the elucidation of the complete 
primary structure of group 4 allergens. 

The object on which the present invention is based therefore comprised the 
provision of the complete DNA sequence of Phi p 4 and of a corresponding 
25 recombinant DNA on the basis of which the Phi p 4 allergen can be 

expressed as protein and made available for pharmacologically significant 
utilisation as such or in modified form. 



30 
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List of figures 

Figure 1: Internal DNA sequence (SEQ ID NO 25) of the Phi p 4 gene 
Amplicons obtained with genomic DNA were cloned with the degenerated 
5 primers No. 30 (sense) and No. 37 (antisense), both shown in italics, and 
sequenced. The sequence shown represents the consensus from 6 clones. 
The specific sense primer No. 82 created from this sequence is shown 
underlined. 

10 Figure 2: 3* end of the nucleic acid sequence (SEQ ID NO 26) of the Phi p 
4 gene 

Amplicons were obtained with the specific sense primer No. 82 (shown in 
italics) and an anchor primer in a 3'-RACE PGR with Phleum pratense 
cDNA and sequenced. The sequence shown represents the consensus 
15 from 3 sequencing processes and covers the 3' end of the Phi p 4 gene to 
the stop codon (double underlined). The sequence ranges employed for 
construction of the antisense primers No. 85 and No. 86 are shown under- 
lined. 

20 Figure 3: Localisation of the Phi p 4 peptides in the deduced amino acid 
sequence of the Phi p 4 allergen (SEQ ID NO 2) 

The peptides P1 - P6 (SEQ ID NO's 27-32) obtained from the amino acid 
sequencing of the purified and fragmented Phi p 4 allergen can unambigu- 
ously be assigned to the amino acid sequence of the Phi p 4 gene derived 
25 from the nucleic acid sequence. 

Figure 4: Determination of the identity of recombinant Phi p 4 (rPhI p 4) by 
means of monoclonal antibodies 5H1 (blot A) and 3C4 (blot B) specific for 
nPhI p 4 by Western blot. 
30 Track 1: E. co// total cell extract comprising rPhI p 4 fragment 1-200 

Track 2: E. co// total cell extract comprising rPhI p 4 fragment 185-500 
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Track 3: E. co// total cell extract comprising rPhI p 4 
Track 4: purified nPhI p 4 from Phleum pretense 

( < ): termination or degradation fragments of C-terminal rPhI p 4 frag- 
ment or rPhI p 4 entire molecule 

5 

Figure 5: Detemnination of the reactivity of recombinant Phi p 4 (rPhI p 4) 
using IgE from sera of grass pollen allergy sufferers by Western blot. 
Extracts of transformed E. co// cells which either express the complete Phi 
p 4 gene or the N-terminal fragment 1-200 or the C-terminal fragment 185- 

10 500 were separated in the SDS-PAGE and transferred to nitrocellulose 
membranes. The blot was incubated with sera from grass pollen-allergic 
donor A, B or C, and bound IgE was subsequently detected colorimetrically 
via an anti-human IgE antibody conjugated with alkaline phosphatase. 
Track 1: E co// total cell extract comprising rPhI p 4 fragment 1-200 

15 Track 2: E co// total cell extract comprising rPhI p 4 fragment 185-500 
Track 3: E co// total cell extract comprising rPhI p 4 
Track 4: purified nPhI p 4 from Phleum pretense 

The numbers used above and below for nucleotide or amino acid 
20 sequences "SEQ ID NO" relate to the sequence protocol attached to the 
description. 

Description of the invention 

25 

The present invention now provides for the first time the genetic sequence 
of the major grass pollen allergen Phi p 4, with three dominant sequences 
(SEQ ID NO 1. 3 and 5) arising from the single nucleotide polymorphisms 
(SNPs) found. 

30 The present invention therefore relates to a DNA molecule corresponding 
to a nucleotide sequence selected from a group consisting of SEQ ID NO 
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1, SEQ ID NO 3 and SEQ ID NO 5 or a DNA molecule corresponding to a 
nucleotide sequence which encodes for the major allergen Phi p 4 from 
Phleum pretense. 

5 The invention also covers fragments, new combinations of partial 
sequences and point mutants having a hypoallergenic action. 

The invention therefore furthermore relates to corresponding partial 
sequences, a combination of partial sequences or exchange, elimination or 
10 addition mutants which encode for an immunomodulatory, T-cell-reactive 
fragment of a group 4 allergen of the Poaceae. 

In addition to the group 4 allergens of the other grass species, the group 13 
allergens are also of interest in connection with the present invention since 

15 they exhibit a very similar molecular weight to the group 4 allergens in the 
SDS-PAGE and are difficult to separate by biochemical techniques (Suck 
et al., 2000, Clin, Exp. Allergy 30: 324-332, Suck et al., 2000, Clin. Exp. 
Allergy 30: 1395-1402). With the aid of the protein and DNA sequence 
according to the invention which is now available for the first time, however, 

20 it can unambiguously be shown that groups 4 and 13 have significantly 
different amino acid sequences. 

With knowledge of the DNA sequence of naturally occurring allergens, it is 
now possible to prepare these allergens as recombinant proteins which can 
25 be used in the diagnosis and therapy of allergic diseases (Scheiner and 
Kraft, 1995. Allergy 50: 384-391). 

A classical approach to effective therapeutic treatment of allergies is spe- 
cific immunotherapy or hyposensitisation (Fiebig, 1995, Allergo J. 4 (6): 
30 336-339, Bousquet et al., 1998, J. Allergy Clin. Immunol. 102(4): 558-562). 
In this method, the patient is injected subcutaneously with natural allergen 
extracts in increasing doses. However, there is a risk in this method of 
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allergic reactions or even anaphylactic shock. In order to minimise these 
risks, innovative preparations in the form of allergoids are being employed. 
These are chemically modified allergen extracts which have significantly 
reduced IgE reactivity, but identical T-cell reactivity compared with the un- 

5 treated extract (Fiebig, 1995. Allergo J. 4 (7): 377-382). 

Even more substantial therapy optimisation would be possible with aller- 
gens prepared by recombinant methods. Defined cocktails of high-purity 
allergens prepared by recombinant methods, optionally matched to the 
individual sensitisation patterns of the patients, could replace extracts from 

10 natural allergen sources since these, in addition to the various allergens, 
contain a relatively large number of immunogenic, but non-allergenic sec- 
ondary proteins. 

Realistic perspectives which may result in reliable hyposensitisation with 
expression products are offered by specifically mutated recombinant aller- 
15 gens in which IgE epitopes are specifically deleted without impairing the 
T-cell epitopes which are essential for therapy (Schramm et al., 1999, J. 
Immunol. 162: 2406-2414). 

A further possibility for therapeutic influencing of the disturbed TH-cell 
20 equilibrium in allergy sufferers is immunotherapeutic DNA vaccination. This 
involves treatment with expressable DNA which encodes for the relevant 
allergens. Initial experimental evidence of allergen-specific influencing of 
the immune response has been furnished in rodents by Injection of aller- 
gen-encoding DNA (Hsu et al., 1996, Nature Medicine 2 (5): 540-544). 

25 

The present invention therefore also relates to a DNA molecule described 
above or below or a corresponding recombinant expression vector as 
medicament. 



30 



The corresponding proteins prepared by recombinant methods can be 
employed for the therapy and for the in vitro and in vivo diagnosis of pollen 
allergies. 
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For preparation of the recombinant allergen, the cloned nucleic acid is 
ligated to an expression vector, and this construct is expressed in a suit- 
able host organism. After biochemical purification, this recombinant aller- 
gen is available for the detection of IgE antibodies by established methods. 

5 

The present invention therefore furthermore relates to a recombinant 
expression vector comprising a DNA molecule described above or below, 
functionally linked to an expression control sequence and a host organism 
transformed with the said DNA molecule or the said expression vector. 

10 

The Invention likewise relates to the use of at least one DNA molecule 
described above or at least one expression vector described above for the 
preparation of a medicament for immunotherapeutic DNA vaccination of 
patients having allergies in the triggering of which group 4 allergens of the 
1 5 Poaceae are involved and/or for the prevention of such allergies. 

As already stated, the invention can be used as an essential component in 
a recombinant allergen- or nucleic acid-containing preparation for specific 
immunotherapy. There are a number of possibilities here. Firstly, the pro- 

20 tein with an unchanged primary structure may be a constituent of the prep- 
aration. Secondly, through specific deletion of IgE epitopes of the entire 
molecule or the preparation of individual fragments which encode for T-cell 
epitopes, a hypoallergenic (allergoidal) form can be used in accordance 
with the invention for therapy in order to prevent undesired side effects. 

25 Finally, the nucleic acid per se, if ligated with a eukaryotic expression 

vector, gives a preparation which on direct application modifies the allergic 
immune state in the therapeutic sense. 

The invention thus relates to recombinant DNA molecules corresponding to 
30 SEQ ID NO 1, 3 or 5, where the nucleotide sequence of positions 1-69 has 
been derived from the amino acid sequence of the Phi p 4 N-terminus. 
Codons which frequently occur in E. co// were used here. From position 70, 
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the DNA sequence corresponds to that which has been identified in 
genomic and cDNA of Phleum pretense. 

The present invention therefore furthermore relates to a DNA molecule 
comprising a nucleotide sequence according to SEQ ID NO 1. SEQ ID 
5 NO 3 or SEQ ID NO 5, commencing with position 70. which encodes for a 
polypeptide having the properties of the major allergen Phi p 4 from 
Phleum pratense. 

Furthermore, the present invention relates to the polypeptides encoded by 
10 one or more of the above-described DNA molecules, preferably in their 
property as medicament. 

These are, in particular, polypeptides according to SEQ ID NO 2, SEQ ID 
NO 4 or SEQ ID NO 6, where amino acid positions 1-33 have been deter- 
mined by N-terminal amino acid sequencing of the isolated natural Phi p 4 
15 allergen. Positions 24-500 were derived from the DNA sequence according 
to SEQ ID NO 1. 3 and 5. Variable amino acids at positions 6, 7, 8 and 9 
originate from the N-terminal protein sequencing of various preparations of 
natural Phi p 4 (Table 1). 

Accordingly, the invention also relates to a process for the preparation of 
20 polypeptides of this type by cultivation of a host organism according to 

Claim 1 1 and isolation of the corresponding polypeptide from the culture. 

The invention likewise relates to the use of at least one polypeptide 
described above for the preparation of a medicament for the diagnosis 
25 and/or treatment of allergies in the triggering of which group 4 allergens of 
the Poaceae are involved and for the prevention of such allergies. 

These polypeptides or proteins according to the invention which act as 
allergens for humans are present in the pollen grains of Phleum pratense. 
30 The pollen grains of the other Poaceae species, such as, for example, 
Lolium perenne, Dactylis glomerate, Poa pretensis, Cynodon dectylon, 
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Holcus lanatus, inter alia, contain homologous allergen molecules (group 4 
allergens). 

The homology of these molecules has been demonstrated through their 
immunological cross-reactivity both with murine monoclonal antibodies and 
5 also with human IgE antibodies. 

Consequently, the invention also relates to sequences which are homolo- 
gous to the Phi p 4 DNA sequence and corresponding DNA molecules of 
group 4 allergens from other Poaceae, such as, for example, Lolium per- 
10 enne, Dactylis glomerata, Poa pratensis, Cynodon dactylon, Holcus lana- 
tus, Triticum aestivum and Hordeum vulgare, which, owing to the sequence 
homology which exists, hybridise with Phi p 4 DNA under stringent condi- 
tions or have immunological cross-reactivity with respect to Phi p 4. 

15 

The following procedure was followed in the determination of the protein 
and DNA sequence of Phi p 4: 

The natural allergen Phi p 4 was purified and isolated by described meth- 
20 ods (Fahlbusch et al. 1998. Clin. Exp. Allergy 28: 799-807. Suck et al. 
2000. Clin. Exp. Allergy 30: 1395-1402). The micropurification and the 
removal of traces of the group 13 allergen was carried out by the method 
described by Suck et al. (2000, Clin. Exp. Allergy 30: 1395-1402). 
The N-terminal amino acid sequence of this Phi p 4 isolated from Phleum 
25 pratense was determined by means of Edman degradation. The N-terminal 
sequences (P1a - f) shown in Table 1 were determined with various 
batches of Phi p 4. The consensus sequence for the first 15 positions is 
regarded as being the following sequence: YFPP'P^AAKEDFLGXL (SEQ ID 
NO 33). Position 14 could not be determined; it is probably occupied by 
30 cysteine. The different amino acids in positions 6. 7. 8 and 9 in the different 
batches indicate variations in the sense of isoforms. Positions 4 and 5 are 
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occupied by hydroxyproline (P'), which was unambiguously determined by 
specific analysis in the analyses of preparations p1-a and -b. 

Treatment of the SDS-denatured Phi p 4 with the endopeptidase Glu-C 
5 (Promega, Heidelberg, Germany) gave various peptides. The amino acid 
sequences shown in Table 1 were determined for two peptides (P2 and 
P3). 2 peptides (P4 and P5) were purified by cleavage using the endo- 
peptidase Lys-C (Roche, Mannheim, Germany) and sequenced (Table 1). 
A further peptide (P6) was isolated by CNBr cleavage and the amino acid 
10 sequence was determined (Table 1). 

The amino acid sequences of the N-terminal sequence and the internal 
peptides 2 and 6 were used as the basis for the construction of degener- 
ated primers. Amplicons were prepared with the sense primer No. 30 and 
15 the antisense primer No. 37 (Table 2) using genomic DNA from Phleum 

pretense. The clones obtained from these amplicons were sequenced (Fig. 

1) and used for the construction of the specific sense primer No. 82 (Table 

2) . Using a cDNA prepared from the representative mRNA population from 
Phleum pretense pollen and the specific sense primer No. 82 according to 

20 the invention and the anchor primer AUAP (Life Technologies, Karlsruhe, 
Germany), a PGR was carried out under stringent conditions. This approxi- 
mately 450 kb amplicon was sequenced and the missing sequence as far 
as the 3' end of the Phi p 4 gene was thus identified (Fig. 2). Based on this 
C-terminal Phi p 4 sequence determined in accordance with the invention, 

25 the specific antisense primers No. 85 and No. 86 were constructed 

(Table 2). Based on the N-terminal amino acid sequence of the Phi p 4 
peptide PI -a (Table 1), the degenerated sense primer No. 29, derived from 
the DNA encoding for amino acid positions 24-33 (LYAKSSPAYP (SEQ ID 
NO 34)), was constructed. 



30 
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A PGR was carried out with primers No. 29 and No. 86 using genomic 
Phleum pratense DNA. This PGR product was employed as the basis for a 
second PGR (nested PGR) with primers No. 29 and No. 85. The amplicons 
were inserted into the vector pGEM T-easy (Promega, Heidelberg, Ger- 

5 many), cloned and sequenced. This sequence begins at position 24 cal- 
culated from the N-terminus or position 70 of the DNA sequence in accor- 
dance with SEQ ID NO 1, 3 or 5 and extends to primer No. 85 (position 
1402 in SEQ ID NO 1, 3 or 5), which is localised in the already determined 
G-terminal section of the Phi p 4 gene. Using these data, the complete 

10 amino acid sequence of the Phi p 4 molecule can be constructed from the 
first 33 amino acid positions, determined by protein sequencing, and the 
deduced amino acid sequence (477 positions), which can be derived from 
the clones prepared with primers No. 29/No. 85 and No. 82/anchor primer. 
The two clones overlap in 197 positions of their nucleotide sequence. The 

15 peptide encoded by clone No. 29/No. 85 overlaps in 10 amino acid posi- 
tions with the N-terminal sequence (positions 1-33), determined by direct 
amino acid sequencing, of Phi p 4, where the amino acids determined by 
the two methods correspond. 

20 The amino acid sequence of Phi p 4 based on the directly determined N- 
terminal amino acids and the deduced amino acid sequence corresponds 
to the sequences listed in the sequence protocol under SEQ ID NO 2, 4 
and 6. 

25 PGR products were prepared with the specific sense primer No. 88 (Table 
2) and the specific antisense primer No. 86 both using genomic and using 
cDNA from Phleum pratense and sequenced directly. 
This enables PGR errors to be excluded and genetic variations (single 
nucleotide polymorphisms) to be discovered. 

30 The single nucleotide polymorphisms found for the DNA sequence SEQ ID 
NO 1 are shown in Table 3. Some of these single nucleotide polymor- 
phisms result in modified amino acids. These are shown in Table 4. Fur- 



wo 2004/000881 



-14- 



PCT/EP2003/006092 



thermore, DNA clones which result in deviating amino acids with respect to 
the dominant sequences SEQ ID NO 2, 4 and 6 were sequenced (Table 5). 
These amino acid variations are to be regarded as isoforms of the Phi p 4 
molecule. The existence of such isoforms to be be expected owing to the 

5 heterogeneous isoelectric behaviour of natural Phi p 4. All pollen allergens 
known hitherto have such isoforms. The fact that the DNA fragment deter- 
mined with primers No. 29 and 86 actually encodes for a protein which is 
identical with the natural Phi p 4 allergen can also be demonstrated, inter 
alia, by the fact that homologous peptide sequences in the deduced amino 

10 acid sequence of the recombinant Phi p 4 molecule according to the inven- 
tion are found (Fig. 3) for the identified internal peptides P3, P4 and P5 
(Table 1) of natural Phi p 4. The Phi p 4 amino acid sequence described 
shows that it is a basic molecule having a calculated isoelectric point of 
8.99 (SEQ ID NO 2), 8.80 (SEQ ID NO 4) or 9.17 (SEQ ID NO 6), consist- 

15 ing of 500 amino acids. The quantitative amino acid composition is shown 
in Table 6. The calculated molecular weight of recombinant Phi p 4 is 
55.762 (SEQ ID NO 2), 55.734 (SEQ ID NO 4) or 55.624 (SEQ ID NO 6) 
daltons. This calculated molecular weight agrees very well with the molecu- 
lar weight of natural Phi p 4 of 55 kDa determined by SDS-PAGE (Fahl- 

20 busch et al., 1998. Clin. Exp. Allergy 28: 799 -807 and Suck et al.. 2000. 
Clin. Exp. Allergy 30: 1395-1402). 

Molecular weights of between 50 and 60 kDa have also been described for 
the group 4 allergens of related grass species (Su et al., 1991, Clin, Exp. 
25 Allergy 21: 449-455; Jaggi et al., 1989, Int. Arch. Allergy Appl. Immunol. 89: 
342-348; Jaggi et al., 1989, J. Allergy Clin. Immunol. 83: 845-852; Leduc- 
Brodard et al., 1996. J. Allergy Clin. Immunol. 98: 1065-1072; 14 - 17). 

For the preparation of the recombinant Phi p 4 protein, the DNA sequence 
30 according to SEQ ID NO 1 , 3 and/or 5 encoding for Phi p 4 was inserted 
into expression vectors (for example pProEx. pXCro, pSE 380). For the 
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N-terminal amino acids known from protein sequencing, E. co// optimised 
codons were used. 

After transformation into E. coli, expression and purification of the recombi- 
5 nant Phi p 4 by various separation methods, the resultant protein was 
subjected to a refolding process. 

This rPhI p 4 protein obtained in this way gives a single band in the SDS- 
PAGE which covers the same molecular weight range as natural Phi p 4. 
The immunological reactivity of rPhI p 4 has been demonstrated by reaction 

10 with the murine monoclonal antibodies 5H1 and 3C4. which had been 

induced using natural Phi p 4 and cross-react with the homologous proteins 
(group 4) of the Poaceae (Fahlbusch et al., 1998, Clin. Exp. Allergy 28:799- 
807; Gavrovic-Jankulovic et a!., 2000, Invest. Allergol. Clin. Immunol. 10 
(6): 361-367) (Fig. 4). rPhl p 4 reacts with IgE antibodies of allergy sufferers 

15 which have demonstrated IgE reactivity with natural Phi p 4. This IgE 

reactivity and thus the action as allergen has been demonstrated both in 
the dot test, Western blot and also after adsorption of the allergen on 
polystyrene microtitre plates. Detection by Western blot is shown in Figure 
5. On reaction of rPhI p 4 with basophiles of allergen group 4-reactive grass 

20 pollen allergy sufferers, these are stimulated to increased expression of the 
activation marker CD 203c. This basophile activation by rPhI p 4 clearly 
shows that this molecule also acts functionally as an allergen. 
This rPhI p 4 allergen can thus be employed for the highly specific diagno- 
sis of grass pollen allergy sufferers. This diagnosis can be carried out in 

25 vitro by detection of specific antibodies (IgE, lgG1-4, IgA) and reaction with 
IgE-loaded effector cells (for example basophiles from the blood) or in vivo 
by skin test reactions and provocation at the reaction organ. 

The reaction of rPhI p 4 with T-lymphocytes of grass pollen allergy sufferers 
30 has been detected by allergen-specific stimulation of the T-lymphocytes for 
proliferation and cytokine synthesis both with T-cells in freshly prepared 
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blood lymphocytes and on established nPhI p 4-reactive T-cell lines and 
clones. 

Based on the rPhI p 4 DNA sequence described, partial sequences 
5 encoding for peptides having from 50 to 350 amino acids were cloned into 
expression vectors. These partial sequences cover sequentially the com- 
plete sequence of rPhI p 4, with overlaps of at least 12 amino acids occur- 
ring. The expressed peptides correspond to Phi p 4 fragments. These Phi p 
4 fragments do not react individually or as a mixture with the IgE antibodies 
10 of allergy sufferers or only do so to a small extent, so that they can be 

classified as hypoallergenic. In contrast, the mixture of these fragments is 
capable, in the same way as complete recombinant or natural Phi p 4, of 
stimulating T-lymphocytes of grass pollen allergy sufferers having Phi p 4 
reactivity. 

15 Figure 4 shows as an example the characterisation of two such Phi p 4 

fragments corresponding to amino acids 1-200 and 185-500 by binding to 
Phi p 4-specific monoclonal mouse antibodies. The C-terminal fragment 
185-500 reacts only with monoclonal antibody 5H1, while the N-terminal 
fragment 1-200 clearly reacts with monoclonal antibody 3C4. It can be seen 

20 from Figure 5 that fragment 185-500 reacts less strongly with the IgE from 
the sera of allergy sufferers B and C. i.e. is less allergenic than fragment 
1-200, which has reduced IgE reactivity (hypoallergeneity). at least to 
patient serum C. 

The present invention therefore also relates to a DNA molecule described 
25 above or below, encoding for a fragment 1-200, with amino acids 1-200 of 
Phi p 4, 

and a DNA molecule encoding for a fragment 285-500. with amino acids 
285-500 of Phi p 4. 

30 The triplets encoding for the cysteines were modified by site-specific muta- 
genesis in such a way that they encode for other amino acids, preferably 
serine. Both variants in which individual cysteines have been replaced and 
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those in which various combinations of 2 cysteine radicals or all 5 cysteines 
have been modified have been prepared. The expressed proteins of these 
cysteine point mutants have highly reduced or zero reactivity with IgE 
antibodies of allergy sufferers, but react with the T-lymphocytes of these 
5 patients. The present invention therefore furthermore relates to a DNA 
molecule described above or below in which one, more or all of the 
cysteine radicals of the corresponding polypeptide have been replaced by 
another amino acid by site-specific mutagenesis. 

10 The immunomodulatory activity of the hypoallergenic fragments which cor- 
respond to polypeptides having T-cell epitopes and those of the hypoaller- 
genic point mutants (for example cysteine polymorphisms) has been dem- 
onstrated by reaction thereof with T-cells of grass pollen allergy sufferers. 

15 Such hypoallergenic fragments or point mutants of the cysteines can be 
employed as preparations for the hyposensitisation of allergy sufferers 
since they react with equal effectiveness with the T-cells. but, owing to the 
reduced or entirely absent IgE reactivity, result in reduced IgE-medlated 
side effects. 

20 

If the nucleic acids encoding for the hypoallergenic Phi p 4 variants or the 
unmodified DNA encoding for Phi p 4 are ligated with a human expression 
vector, these constructs can likewise be used as preparations for immuno- 
therapy (DNA vaccination). 

25 

Finally, the present invention relates to pharmaceutical compositions com- 
prising at least one DNA molecule described above or at least one expres- 
sion vector described above and optionally further active ingredients and/or 
adjuvants for immunotherapeutic DNA vaccination of patients having 
30 allergies in the triggering of which group 4 allergens of the Poaceae are 
involved and/or for the prevention of such allergies. 
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A further group of pharmaceutical compositions according to the invention 
comprises, instead of the DNA, at least one polypeptide described above 
and is suitable for the diagnosis and/or treatment of the said allergies. 

5 Pharmaceutical compositions in the sense of the present invention 

comprise, as active ingredients, a polypeptide according to the invention or 
an expression vector and/or respective pharmaceutically usable derivatives 
thereof, including mixtures thereof in all ratios. The active ingredients 
according to the invention can be brought here into a suitable dosage form 

10 together with at least one solid, liquid and/or semi-liquid excipient or adju- 
vant and optionally in combination with one or more further active ingredi- 
ents. 

Particularly suitable adjuvants are immunostimulatory DNA or oligonucleo- 
tides having CpG motives. 

15 

These compositions can be used as therapeutic agents or diagnostic 
agents in human or veterinary medicine. Suitable excipients are organic or 
inorganic substances which are suitable for parenteral administration and 
do not adversely affect the action of the active ingredient according to the 

20 invention. Particularly suitable for parenteral administration are solutions, 
preferably oil-based or aqueous solutions, furthermore suspensions, emul- 
sions or implants. The active ingredient according to the invention may also 
be lyophilised and the resultant lyophilisates used, for example, for the 
preparation of injection preparations. The compositions indicated may be 

25 sterilised and/or comprise adjuvants, such as lubricants, preservatives, 
stabilisers and/or wetting agents, emulsifiers, salts for modifying the 
osmotic pressure, buffer substances and/or a plurality of further active 
ingredients. 

Furthermore, sustained-release preparations can be obtained by corres- 
30 ponding formulation of the active ingredient according to the invention. 
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The invention thus also serves for improving in vitro diagnosis as part of 
allergen component-triggering identification of the patient-specific sensitisa- 
tion spectrum. The invention likewise serves for the preparation of signifi- 
cantly improved preparations for the specific immunotherapy of grass 
5 pollen allergies. 



Table 1 Amino acid sequence of Phi p 4 peptides 



10 



15 



Preparation 


Peptide 
batch 


SEQ 

ID 

NO 


Amino acids 

1 6 11 16 21 26 31 


Intact Phi p 4 


P1-a 


35 


YFPP'P' AAKED FLGXL VKEIP PRLLY AKSSP AYP 


P1-b 


36 


YFPP'P' AAKED FLGXL VKE-P PRLLY AKSSP 




P1-C 


37 


YFPXX AAKED FLGXL 




P1-d 


38 


YFPXX AKKED FLGXL 




P1-e 


39 


YFPXX AAKDD FLGXL 




P1-f 


40 


YFPXX LANED F 


Glu-C 


P2 


41 


SATPF XHRKG VLFNI QYV 


fragments 


P3 


42 


GLXYR XLXPE 


Lys-C 


P4 


43 


KXMGD DHFXA VR 


fragments 


P5 


44 


APEGA VDI 1 


CNBr 


P6 


45 


MEPYV SINPV QAYAN Y 


fragment 









20 

Table 2 Degenerated and specific sense and antisense primers 

constructed on the basis of Phi p 4 peptide sequences 
and DNA sequences 



25 



30 



Primer 
No. 


Peptide/ 
DNA 


Sense/ 
anti- 
sense 


SEQ 

ID 

NO 


Nucleotide sequence 


29 


Phi p 4-P1 


s 


46 


YTN TAY GCN AAR WSN WSN 
CCN GCN TAY CC 


30 


Phi p 4-P2 


s 


47 


CAY MGN AAR GGN GTN YTN TTY 

AAY ATM C 


37 


Phi p 4-P6 


as 


48 


TAR TTN GCR TAN GCY TGN ACN 
GGRTT 


82 


Phi p 4-DNA-NYW 


s 


49 


ACT ACT GGT TCG CCC CGG GAG 
CC 


85 


Phi p 4-DNA-GLV 


as 


50 


TGA AGT ATT TCT GGC CCC ACA 

CCA AAC C 


86 


Phi p 4-DNA-QRL 


as 


51 


CCC TTG GTG ATG GCG AGC CTC 
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TGG 


88 


Phi p 4-DNA-PSV 


s 


52 


CTC AGT CCT GGG GCA GAC CAT 
CC 



The nucleotide sequences of primers 82, 85, 86 and 88 is shown in the 
usual 4-letter code. In the case of primers 29, 30 and 37. the lUPAC-lUB 
DNA code Is used; the letter 'N' here stands for inosine. 



Table 3 Detected single nucleotide polymorphisms 



10 



25 



30 



Position in 


Nucleotide according to 


Detected SNPs 


sequence 


SEQ ID NO 1 




85 


T 


A 


130 


C 


A 


159 


G 


A 


160 


A 


C 


169 


G 


A 


185 


C 


T 


186 


C 


A 


222 


G 


C 


226 


G 


A 


227 


G 


C 


228 


T 


C 


237 


C 


T 


273 


C 


T 


285 


c 


T 


286 


c 


T 


298 


G 


A 


299 


A 


C 


303 


C 


T 


309 


C 


G 


318 


T 


C 


320 


G 


A 


333 


C 


G 


348 


G 


C 


369 


C 


G 


409 


C 


T 


411 


C 


T 


420 


T 


C 


421 


A 


C 


423 


A 


c 


424 


G 


A 
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10 



15 



425 


T 


c 


456 


c 


G 


462 


c 


A 


522 


G 


c 


525 


c 


G 


567 


G 


A 


618 


c 


T 


655 


A 


c 


657 


G 


A 


662 


G 


A 


VJ w 


c 


T 


684 


G 


c 


w C7 w 


c 


A 


6Q1 


G 


A 




G 


A 


703 


c 


T A 


710 


A 


c 


71 1 


G 


A 


713 

f 1 


c 


T 


743 


G 


A 


7*^0 


G 


A 


768 


c 


T 


773 
f 1 o 


A 


c 


7Q0 




A 


7QR 


G 


c 


R01 


G 


A 


804 


c 


G 




c 


A 


834 


G 


c 


844 




A 




A 


T 




A 


G 


R7Q 


G 


c 




G 


c 


QOO 


G 


C A 


Q1R 


G 


A 


QR1 


A 


G 


QR9 


A 


c 


964 


A 


C 


987 


G 


C 


994 


A 


T 


1020 


G 


A 


1023 


G 


C 


1036 


G 


C 


1040 


C 


T 


1041 


G 


C 



20 



25 



30 
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15 



20 



1047 


c 


A 


1051 


A 


G 


1052 


G 


A C 


1053 


G 


ACT 


1056 


G 


c 


1069 


T 


c 


1073 


G 


A 


1084 


c 


G 


1086 


G 


c 


1090 


c 

V—/ 


T 
1 


1098 


G 


c 


1 151 


G 


c 


1 152 


G 


c 


1 155 


G 


c 


1 161 


G 


c 


1 185 


c 


G 


1229 


G 


c 


1233 


G 


c 


1239 


A 


c 


1240 


T 
1 


c 


1242 


G 


c 


12*57 


G 


c 


1266 


c 


T 
1 


1269 


c 


J 


1278 


A 


C G 


1305 


c 


G 


1308 


c 


T 
1 


1 31 1 




A 


1 335 


G 


c 


1350 


G 


c 


1 357 


T 


A 


1359 


A 


G 


1370 


G 


c 


1377 


T 
1 


c 


1 378 


T 
1 


A 


1379 


T 
1 


A 


1383 


G 


c 


1 398 


c 


T 
1 


1411 


T 


c 


1414 


C 


G 


1425 


C 


A 


1428 


C 


T 


1443 


G 


C 


1449 


C 


T 


1464 


G 


A 


1485 


G 


A 
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1498 



Table 4 Amino acid exclianges as a consequence of single 
nucleotide polymorphisms 



r OolllUll 111 


/AlillllU aOlU dUOwlUIII^ WJ 




Qon 1 1 o n 


QFO ID NO 7 




Q 


A 


1 


7 


A 


K 


Q 
O 


K 
r\ 


N 


Q 


F 


n 

L./ 


9Q 


c 
o 


T 
i 




1 

1 


1 


^7 

O f 


V 


1 
1 




A 


V 


/ o 




T N 55 

1 , IN, O 


1 uu 


P 


T 
1 


1 U r 


c 
o 


N 
IN 


1 O / 


u 

n 


Y 


1 H 1 


T 
1 


p 


149 


V 


A T 

/A, 1 


1 RQ 


T 
1 


K 


91 Q 


fx 


o 


991 


p 


fx 


997 


P 

r 


1 


9*^1 


V 


1 
1 




P 


T S 


237 


K 


T 


238 


A 


V 


248 


R 


K 


258 


D 


A 


264 


V 


1 


270 


T 


K 


282 


Q 


K 


287 


M 


L 


289 


S 


G 


299 


A 


P 


321 


N 


A 


322 


1 


L 


332 


T 


S 


346 


E 


Q 


347 


P 


L 



10 



15 



20 



25 



30 
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351 


R 


E T 


357 


F 


L 


358 


s 


N 


362 


L 


V 


364 

w w~ 


p 


s 


384 


W 


s 


410 


G 


A 


419 


E 


D 


456 


F 


Y 


457 


S 


A, N 


460 


L 


K 


468 


K 


M 


472 


Q 


E 


498 


K 


Q 



Table 5 Deviating amino acid positions in individual recombi- 
nant Phi p 4 clones compared with SEQ ID NO 2 



15 



25 



Example 


Deviating positions* 


Clone 1 


L54, 157, V62, S76. T100, N107, Y137, P141, T142, K189, 
Q219, K221, L227, 1231, S235, T237, V238, K248, A258, 1264, 
K270, K282, L287, P299, A321, L322, S332, Q346, P347. 
T351, L357, N358. V362. S384, A410, D419. Y456, A457, 
K460. E472 


Clone 2 


L54. 157. V62. T76, T100, N107, Y137, P141, T142, K189, 
Q219, K221, 1231, S235, T237, V238, K248, A258, 1264, K270. 
K282, L287, P299, A321, L322, S332, Q346, P347, T351, 
L357, N358, V362, S384, A410, 041 9, Y456, A457, K460, 
E472 


Clone 3 


P141, K282, L287, P299, L347, E351 


Clone 4 


G289, A410, D419, Y456, A457, K460. E472 


Clone 5 


L347, E351, S384. A410, D419, Y456, A457, K460. E472 


Clone 6 


N107, Y137. P141, T142, K189, Q219. K221, 1231, S235. 
T237, V238, K248, A258, 1264, K270, K282, L287, P299, A321, 
L322, S332. Q346, P347, T351. L357, N358, V362, S384, 
A410. D419. Y456. A457, K460 


Clone 7 


K248, A258, 1264, K270, K282, L287, P299, A321, L322, S332, 
Q346, P347, T351, L357. N358, V362, S384 


Clone 8 


Q219, K221, 1231, S235, T237, V238, K248, A258, 1264, K270, 
K282, L287. P299. E351 


Clone 9 


M231, T246. A251, C263, G289, L307, L309. E334 


Clone 10 


Q219, K221, 1231, S235, T237, M238. V242. V246, K248, 
A258. 1264, K270, K282, L287, P299. A321. L322, S332, Q346, 
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P347, T351, N358, V362. S384. insertion of GA between 
positions 407 and 408. N452, Y456, A457, K460, E472 


Clone 1 1 


Insertion of GA between positions 407 and 408 



*[Amino acid according to SEQ ID NO 2 / position in sequence / deviating 
amino acid] 

5 



Table 6 Amino acid composition of Phi p 4 



15 



20 



Amino acids 


Number 


% by weight 


Charged 


138/138/138 


33.89/33.86/33.93 


Acid 


45/46/43 


9.82/10.05/9.38 


Basic 


54/53/55 


13.67/13.39/13.78 


Polar 


120/119/124 


24.88/24.71/25.89 


Hydrophobic 


180/180/180 


35.64/35.66/35.43 


A Ala 


40/40/41 


5.10/5.10/5.24 


C Cvs 


5/5/5 


0.92/0.93/0.93 


D Asp 


24/24/24 


4.95/4.96/4.97 


EGlu 


21/22/19 


4.86/5.10/4.41 


F Phe 


24/24/22 


6.33/6.34/5.82 


G Gly 


42/42/40 


4.30/4.30/4.10 


H His 


10/10/9 


2.46/2.46/2.22 


1 lie 


29/29/30 


5.88/5.89/6.10 


KLys 


29/29/33 


6.67/6.67/7.60 


L Leu 


33/33/35 


6.70/6.70/7.12 


M Met 


11/11/10 


2.59/2.59/2.36 


N Asn 


22/22/23 


4.50/4.50/4.72 


P Pro* 


38/39/39 


6.62/6.80/6.81 


QGIn 


15/15/15 


3.45/3.45/3.46 


RArg 


25/24/22 


7.00/6.73/6.18 


S Ser 


32/32/33 


5.00/5.00/5.17 


TThr 


22/21/22 


3.99/3.81/4.00 


VVal 


41/41/40 


7.29/7.29/7.13 


WTrp 


13/13/12 


4.34/4.34/4.02 


YTyr 


24/24/26 


7.02/7.03/7.63 



* including hydroxyproline 

The values are given for the three dominant sequences in the order SEQ 
ID NO 2 / SEQ ID NO 4 / SEQ ID NO 6. 



